Feature extraction for speech recognition based on orthogonal acoustic-feature planes and LDA
نویسنده
چکیده
This paper describes an attempt to extract multiple topological structures. hidden in time-spectrum (TS) patterns, by using multiple mappmg operators, and to incorporate the operators into the feature extractor of a speech recognition system. In the previous work, the author proposed a novel feature-extraction method based on MAFP/KLT (MAFP: multiple acoustic-feature planes), in which 3x3 derivative filters were used for mappmg operators, and showed that the method achieved significant improvement in preliminary experiments. In this paper, firstly, the mapping operators are directly extracted in the form of a 3x3 orthogonal basis from a speech database. Next, the operators are evaluated, together with 3x3 simplified operators modeled on the orthogonal basis. Finally, after comparing the experimental results, the author proposes an effective feature-extraction method based on MAFHLDA. in which a Sobel filter is used for mapping operators.
منابع مشابه
A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملSupervised Feature Extraction of Face Images for Improvement of Recognition Accuracy
Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...
متن کاملA novel feature extraction using multiple acoustic feature planes for HMM-based speech recognition
This paper describes an attempt to extract multiple peripheral features of a point x(ti,fj) on a timespectrum (TS) pattern by observing n×n neighborhoods of the point, and to incorporate these peripheral features (MPFPs: multiple peripheral feature planes) into the feature extractor of a speech recognition system together with MFCC parameters. Two types of peripheral feature extractor, MPFP-KL ...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملComparative Analysis of Wavelet-based Feature Extraction for Intramuscular EMG Signal Decomposition
Background: Electromyographic (EMG) signal decomposition is the process by which an EMG signal is decomposed into its constituent motor unit potential trains (MUPTs). A major step in EMG decomposition is feature extraction in which each detected motor unit potential (MUP) is represented by a feature vector. As with any other pattern recognition system, feature extraction has a significant impac...
متن کامل